Overview
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 8693 |
| Missing cells | 2324 |
| Missing cells (%) | 1.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 891.5 KiB |
| Average record size in memory | 105.0 B |
Variable types
| Text | 3 |
|---|---|
| Categorical | 2 |
| Boolean | 3 |
| Numeric | 6 |
FoodCourt is highly overall correlated with VRDeck | High correlation |
VRDeck is highly overall correlated with FoodCourt | High correlation |
VIP is highly imbalanced (84.0%) | Imbalance |
HomePlanet has 201 (2.3%) missing values | Missing |
CryoSleep has 217 (2.5%) missing values | Missing |
Cabin has 199 (2.3%) missing values | Missing |
Destination has 182 (2.1%) missing values | Missing |
Age has 179 (2.1%) missing values | Missing |
VIP has 203 (2.3%) missing values | Missing |
RoomService has 181 (2.1%) missing values | Missing |
FoodCourt has 183 (2.1%) missing values | Missing |
ShoppingMall has 208 (2.4%) missing values | Missing |
Spa has 183 (2.1%) missing values | Missing |
VRDeck has 188 (2.2%) missing values | Missing |
Name has 200 (2.3%) missing values | Missing |
PassengerId has unique values | Unique |
Age has 178 (2.0%) zeros | Zeros |
RoomService has 5577 (64.2%) zeros | Zeros |
FoodCourt has 5456 (62.8%) zeros | Zeros |
ShoppingMall has 5587 (64.3%) zeros | Zeros |
Spa has 5324 (61.2%) zeros | Zeros |
VRDeck has 5495 (63.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-25 18:26:31.661497 |
|---|---|
| Analysis finished | 2025-12-25 18:26:37.367600 |
| Duration | 5.71 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
PassengerId
Text
Unique
| Distinct | 8693 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 8693 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0001_01 |
|---|---|
| 2nd row | 0002_01 |
| 3rd row | 0003_01 |
| 4th row | 0003_02 |
| 5th row | 0004_01 |
| Value | Count | Frequency (%) |
| 0005_01 | 1 | < 0.1% |
| 9280_02 | 1 | < 0.1% |
| 0001_01 | 1 | < 0.1% |
| 0002_01 | 1 | < 0.1% |
| 0003_01 | 1 | < 0.1% |
| 9264_01 | 1 | < 0.1% |
| 9267_01 | 1 | < 0.1% |
| 9267_02 | 1 | < 0.1% |
| 9268_01 | 1 | < 0.1% |
| 9270_01 | 1 | < 0.1% |
| Other values (8683) | 8683 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12459 | |
| 1 | 9827 | |
| _ | 8693 | |
| 2 | 5017 | |
| 3 | 4039 | 6.6% |
| 4 | 3790 | 6.2% |
| 6 | 3664 | 6.0% |
| 5 | 3606 | 5.9% |
| 8 | 3557 | 5.8% |
| 7 | 3410 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 60851 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 12459 | |
| 1 | 9827 | |
| _ | 8693 | |
| 2 | 5017 | |
| 3 | 4039 | 6.6% |
| 4 | 3790 | 6.2% |
| 6 | 3664 | 6.0% |
| 5 | 3606 | 5.9% |
| 8 | 3557 | 5.8% |
| 7 | 3410 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 60851 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 12459 | |
| 1 | 9827 | |
| _ | 8693 | |
| 2 | 5017 | |
| 3 | 4039 | 6.6% |
| 4 | 3790 | 6.2% |
| 6 | 3664 | 6.0% |
| 5 | 3606 | 5.9% |
| 8 | 3557 | 5.8% |
| 7 | 3410 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 60851 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 12459 | |
| 1 | 9827 | |
| _ | 8693 | |
| 2 | 5017 | |
| 3 | 4039 | 6.6% |
| 4 | 3790 | 6.2% |
| 6 | 3664 | 6.0% |
| 5 | 3606 | 5.9% |
| 8 | 3557 | 5.8% |
| 7 | 3410 | 5.6% |
HomePlanet
Categorical
Missing
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 201 |
| Missing (%) | 2.3% |
| Memory size | 68.0 KiB |
| Earth | |
|---|---|
| Europa | |
| Mars |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0438059 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Europa |
|---|---|
| 2nd row | Earth |
| 3rd row | Europa |
| 4th row | Europa |
| 5th row | Earth |
Common Values
| Value | Count | Frequency (%) |
| Earth | 4602 | |
| Europa | 2131 | |
| Mars | 1759 | 20.2% |
| (Missing) | 201 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| earth | 4602 | |
| europa | 2131 | |
| mars | 1759 | 20.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 42832 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 42832 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 42832 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 8492 | |
| r | 8492 | |
| E | 6733 | |
| t | 4602 | |
| h | 4602 | |
| u | 2131 | 5.0% |
| o | 2131 | 5.0% |
| p | 2131 | 5.0% |
| M | 1759 | 4.1% |
| s | 1759 | 4.1% |
CryoSleep
Boolean
Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 217 |
| Missing (%) | 2.5% |
| Memory size | 68.0 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 217 |
| Value | Count | Frequency (%) |
| False | 5439 | |
| True | 3037 | |
| (Missing) | 217 | 2.5% |
Cabin
Text
Missing
| Distinct | 6560 |
|---|---|
| Distinct (%) | 77.2% |
| Missing | 199 |
| Missing (%) | 2.3% |
| Memory size | 68.0 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0775842 |
| Min length | 5 |
Unique
| Unique | 5427 ? |
|---|---|
| Unique (%) | 63.9% |
Sample
| 1st row | B/0/P |
|---|---|
| 2nd row | F/0/S |
| 3rd row | A/0/S |
| 4th row | A/0/S |
| 5th row | F/1/S |
| Value | Count | Frequency (%) |
| g/734/s | 8 | 0.1% |
| c/137/s | 7 | 0.1% |
| g/1476/s | 7 | 0.1% |
| b/11/s | 7 | 0.1% |
| f/1194/p | 7 | 0.1% |
| b/82/s | 7 | 0.1% |
| d/176/s | 7 | 0.1% |
| g/981/s | 7 | 0.1% |
| e/13/s | 7 | 0.1% |
| f/1411/p | 7 | 0.1% |
| Other values (6550) | 8423 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 16988 | |
| 1 | 5326 | 8.9% |
| S | 4288 | 7.1% |
| P | 4206 | 7.0% |
| 2 | 3078 | 5.1% |
| F | 2794 | 4.6% |
| 3 | 2601 | 4.3% |
| G | 2559 | 4.3% |
| 4 | 2393 | 4.0% |
| 5 | 2377 | 4.0% |
| Other values (11) | 13507 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 60117 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| / | 16988 | |
| 1 | 5326 | 8.9% |
| S | 4288 | 7.1% |
| P | 4206 | 7.0% |
| 2 | 3078 | 5.1% |
| F | 2794 | 4.6% |
| 3 | 2601 | 4.3% |
| G | 2559 | 4.3% |
| 4 | 2393 | 4.0% |
| 5 | 2377 | 4.0% |
| Other values (11) | 13507 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 60117 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| / | 16988 | |
| 1 | 5326 | 8.9% |
| S | 4288 | 7.1% |
| P | 4206 | 7.0% |
| 2 | 3078 | 5.1% |
| F | 2794 | 4.6% |
| 3 | 2601 | 4.3% |
| G | 2559 | 4.3% |
| 4 | 2393 | 4.0% |
| 5 | 2377 | 4.0% |
| Other values (11) | 13507 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 60117 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| / | 16988 | |
| 1 | 5326 | 8.9% |
| S | 4288 | 7.1% |
| P | 4206 | 7.0% |
| 2 | 3078 | 5.1% |
| F | 2794 | 4.6% |
| 3 | 2601 | 4.3% |
| G | 2559 | 4.3% |
| 4 | 2393 | 4.0% |
| 5 | 2377 | 4.0% |
| Other values (11) | 13507 |
Destination
Categorical
Missing
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 182 |
| Missing (%) | 2.1% |
| Memory size | 68.0 KiB |
| TRAPPIST-1e | |
|---|---|
| 55 Cancri e | |
| PSO J318.5-22 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 11.187052 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TRAPPIST-1e |
|---|---|
| 2nd row | TRAPPIST-1e |
| 3rd row | TRAPPIST-1e |
| 4th row | TRAPPIST-1e |
| 5th row | TRAPPIST-1e |
Common Values
| Value | Count | Frequency (%) |
| TRAPPIST-1e | 5915 | |
| 55 Cancri e | 1800 | 20.7% |
| PSO J318.5-22 | 796 | 9.2% |
| (Missing) | 182 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| trappist-1e | 5915 | |
| 55 | 1800 | 13.9% |
| cancri | 1800 | 13.9% |
| e | 1800 | 13.9% |
| pso | 796 | 6.2% |
| j318.5-22 | 796 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| - | 6711 | 7.0% |
| S | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| R | 5915 | 6.2% |
| I | 5915 | 6.2% |
| A | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 95213 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| - | 6711 | 7.0% |
| S | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| R | 5915 | 6.2% |
| I | 5915 | 6.2% |
| A | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 95213 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| - | 6711 | 7.0% |
| S | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| R | 5915 | 6.2% |
| I | 5915 | 6.2% |
| A | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 95213 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 12626 | |
| T | 11830 | |
| e | 7715 | 8.1% |
| - | 6711 | 7.0% |
| S | 6711 | 7.0% |
| 1 | 6711 | 7.0% |
| R | 5915 | 6.2% |
| I | 5915 | 6.2% |
| A | 5915 | 6.2% |
| 5 | 4396 | 4.6% |
| Other values (13) | 20768 |
Age
Real number (ℝ)
Missing Zeros
| Distinct | 80 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 179 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.82793 |
| Minimum | 0 |
|---|---|
| Maximum | 79 |
| Zeros | 178 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 19 |
| median | 27 |
| Q3 | 38 |
| 95-th percentile | 56 |
| Maximum | 79 |
| Range | 79 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 14.489021 |
|---|---|
| Coefficient of variation (CV) | 0.50260359 |
| Kurtosis | 0.10193292 |
| Mean | 28.82793 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.41909658 |
| Sum | 245441 |
| Variance | 209.93174 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 324 | 3.7% |
| 18 | 320 | 3.7% |
| 21 | 311 | 3.6% |
| 19 | 293 | 3.4% |
| 23 | 292 | 3.4% |
| 22 | 291 | 3.3% |
| 20 | 277 | 3.2% |
| 26 | 268 | 3.1% |
| 28 | 267 | 3.1% |
| 27 | 259 | 3.0% |
| Other values (70) | 5612 |
| Value | Count | Frequency (%) |
| 0 | 178 | |
| 1 | 67 | 0.8% |
| 2 | 75 | |
| 3 | 75 | |
| 4 | 71 | 0.8% |
| 5 | 33 | 0.4% |
| 6 | 40 | 0.5% |
| 7 | 52 | 0.6% |
| 8 | 46 | 0.5% |
| 9 | 42 | 0.5% |
| Value | Count | Frequency (%) |
| 79 | 3 | < 0.1% |
| 78 | 3 | < 0.1% |
| 77 | 2 | < 0.1% |
| 76 | 2 | < 0.1% |
| 75 | 4 | |
| 74 | 5 | |
| 73 | 7 | |
| 72 | 4 | |
| 71 | 7 | |
| 70 | 9 |
VIP
Boolean
Imbalance Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 203 |
| Missing (%) | 2.3% |
| Memory size | 68.0 KiB |
| False | |
|---|---|
| True | 199 |
| (Missing) | 203 |
| Value | Count | Frequency (%) |
| False | 8291 | |
| True | 199 | 2.3% |
| (Missing) | 203 | 2.3% |
RoomService
Real number (ℝ)
Missing Zeros
| Distinct | 1273 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 181 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 224.68762 |
| Minimum | 0 |
|---|---|
| Maximum | 14327 |
| Zeros | 5577 |
| Zeros (%) | 64.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 47 |
| 95-th percentile | 1274.25 |
| Maximum | 14327 |
| Range | 14327 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 666.71766 |
|---|---|
| Coefficient of variation (CV) | 2.9673093 |
| Kurtosis | 65.273802 |
| Mean | 224.68762 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.3330141 |
| Sum | 1912541 |
| Variance | 444512.44 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5577 | |
| 1 | 117 | 1.3% |
| 2 | 79 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 9 | 25 | 0.3% |
| 8 | 24 | 0.3% |
| 6 | 24 | 0.3% |
| 14 | 21 | 0.2% |
| Other values (1263) | 2509 | |
| (Missing) | 181 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 5577 | |
| 1 | 117 | 1.3% |
| 2 | 79 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 6 | 24 | 0.3% |
| 7 | 17 | 0.2% |
| 8 | 24 | 0.3% |
| 9 | 25 | 0.3% |
| Value | Count | Frequency (%) |
| 14327 | 1 | |
| 9920 | 1 | |
| 8586 | 1 | |
| 8243 | 1 | |
| 8209 | 1 | |
| 8168 | 1 | |
| 8151 | 1 | |
| 8142 | 1 | |
| 8030 | 1 | |
| 7406 | 1 |
FoodCourt
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 1507 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 183 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 458.0772 |
| Minimum | 0 |
|---|---|
| Maximum | 29813 |
| Zeros | 5456 |
| Zeros (%) | 62.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 76 |
| 95-th percentile | 2748.5 |
| Maximum | 29813 |
| Range | 29813 |
| Interquartile range (IQR) | 76 |
Descriptive statistics
| Standard deviation | 1611.4892 |
|---|---|
| Coefficient of variation (CV) | 3.5179425 |
| Kurtosis | 73.30723 |
| Mean | 458.0772 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.1022279 |
| Sum | 3898237 |
| Variance | 2596897.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5456 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 9 | 28 | 0.3% |
| 10 | 27 | 0.3% |
| 7 | 27 | 0.3% |
| Other values (1497) | 2611 | |
| (Missing) | 183 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 5456 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 7 | 27 | 0.3% |
| 8 | 20 | 0.2% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| 29813 | 1 | |
| 27723 | 1 | |
| 27071 | 1 | |
| 26830 | 1 | |
| 21066 | 1 | |
| 18481 | 1 | |
| 17958 | 1 | |
| 17901 | 1 | |
| 17687 | 1 | |
| 17432 | 1 |
ShoppingMall
Real number (ℝ)
Missing Zeros
| Distinct | 1115 |
|---|---|
| Distinct (%) | 13.1% |
| Missing | 208 |
| Missing (%) | 2.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 173.72917 |
| Minimum | 0 |
|---|---|
| Maximum | 23492 |
| Zeros | 5587 |
| Zeros (%) | 64.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 27 |
| 95-th percentile | 927.8 |
| Maximum | 23492 |
| Range | 23492 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 604.69646 |
|---|---|
| Coefficient of variation (CV) | 3.4806847 |
| Kurtosis | 328.87091 |
| Mean | 173.72917 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.627562 |
| Sum | 1474092 |
| Variance | 365657.81 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5587 | |
| 1 | 153 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 7 | 36 | 0.4% |
| 6 | 34 | 0.4% |
| 13 | 29 | 0.3% |
| 9 | 28 | 0.3% |
| Other values (1105) | 2396 | |
| (Missing) | 208 | 2.4% |
| Value | Count | Frequency (%) |
| 0 | 5587 | |
| 1 | 153 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 6 | 34 | 0.4% |
| 7 | 36 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| 23492 | 1 | |
| 12253 | 1 | |
| 10705 | 1 | |
| 10424 | 1 | |
| 9058 | 1 | |
| 7810 | 1 | |
| 7185 | 1 | |
| 7148 | 1 | |
| 7104 | 1 | |
| 6805 | 1 |
Spa
Real number (ℝ)
Missing Zeros
| Distinct | 1327 |
|---|---|
| Distinct (%) | 15.6% |
| Missing | 183 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 311.13878 |
| Minimum | 0 |
|---|---|
| Maximum | 22408 |
| Zeros | 5324 |
| Zeros (%) | 61.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 59 |
| 95-th percentile | 1607.1 |
| Maximum | 22408 |
| Range | 22408 |
| Interquartile range (IQR) | 59 |
Descriptive statistics
| Standard deviation | 1136.7055 |
|---|---|
| Coefficient of variation (CV) | 3.6533715 |
| Kurtosis | 81.20211 |
| Mean | 311.13878 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.6360199 |
| Sum | 2647791 |
| Variance | 1292099.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5324 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 5 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 7 | 34 | 0.4% |
| 6 | 33 | 0.4% |
| 9 | 28 | 0.3% |
| 8 | 28 | 0.3% |
| Other values (1317) | 2660 | |
| (Missing) | 183 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 5324 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 5 | 53 | 0.6% |
| 6 | 33 | 0.4% |
| 7 | 34 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| 22408 | 1 | |
| 18572 | 1 | |
| 16594 | 1 | |
| 16139 | 1 | |
| 15586 | 1 | |
| 15331 | 1 | |
| 15238 | 1 | |
| 14970 | 1 | |
| 13995 | 1 | |
| 13902 | 1 |
VRDeck
Real number (ℝ)
High correlation Missing Zeros
| Distinct | 1306 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 188 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 304.85479 |
| Minimum | 0 |
|---|---|
| Maximum | 24133 |
| Zeros | 5495 |
| Zeros (%) | 63.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 46 |
| 95-th percentile | 1534.2 |
| Maximum | 24133 |
| Range | 24133 |
| Interquartile range (IQR) | 46 |
Descriptive statistics
| Standard deviation | 1145.7172 |
|---|---|
| Coefficient of variation (CV) | 3.7582391 |
| Kurtosis | 86.011186 |
| Mean | 304.85479 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.8197316 |
| Sum | 2592790 |
| Variance | 1312667.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5495 | |
| 1 | 139 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 5 | 51 | 0.6% |
| 4 | 47 | 0.5% |
| 6 | 32 | 0.4% |
| 8 | 30 | 0.3% |
| 7 | 29 | 0.3% |
| 9 | 25 | 0.3% |
| Other values (1296) | 2531 | |
| (Missing) | 188 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 5495 | |
| 1 | 139 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 4 | 47 | 0.5% |
| 5 | 51 | 0.6% |
| 6 | 32 | 0.4% |
| 7 | 29 | 0.3% |
| 8 | 30 | 0.3% |
| 9 | 25 | 0.3% |
| Value | Count | Frequency (%) |
| 24133 | 1 | |
| 20336 | 1 | |
| 17306 | 1 | |
| 17074 | 1 | |
| 16337 | 1 | |
| 14485 | 1 | |
| 12708 | 1 | |
| 12685 | 1 | |
| 12682 | 1 | |
| 12424 | 1 |
Name
Text
Missing
| Distinct | 8473 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 200 |
| Missing (%) | 2.3% |
| Memory size | 68.0 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 13.833628 |
| Min length | 7 |
Unique
| Unique | 8453 ? |
|---|---|
| Unique (%) | 99.5% |
Sample
| 1st row | Maham Ofracculy |
|---|---|
| 2nd row | Juanna Vines |
| 3rd row | Altark Susent |
| 4th row | Solam Susent |
| 5th row | Willy Santantines |
| Value | Count | Frequency (%) |
| willy | 20 | 0.1% |
| casonston | 18 | 0.1% |
| oneiles | 16 | 0.1% |
| domington | 15 | 0.1% |
| litthews | 15 | 0.1% |
| cartez | 14 | 0.1% |
| garnes | 14 | 0.1% |
| fulloydez | 14 | 0.1% |
| browlerson | 14 | 0.1% |
| distured | 13 | 0.1% |
| Other values (4880) | 16833 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 12691 | 10.8% |
| a | 10251 | 8.7% |
| n | 9155 | 7.8% |
| 8493 | 7.2% | |
| r | 7707 | 6.6% |
| o | 6563 | 5.6% |
| i | 6456 | 5.5% |
| l | 6231 | 5.3% |
| s | 5299 | 4.5% |
| t | 4552 | 3.9% |
| Other values (43) | 40091 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 117489 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 12691 | 10.8% |
| a | 10251 | 8.7% |
| n | 9155 | 7.8% |
| 8493 | 7.2% | |
| r | 7707 | 6.6% |
| o | 6563 | 5.6% |
| i | 6456 | 5.5% |
| l | 6231 | 5.3% |
| s | 5299 | 4.5% |
| t | 4552 | 3.9% |
| Other values (43) | 40091 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 117489 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 12691 | 10.8% |
| a | 10251 | 8.7% |
| n | 9155 | 7.8% |
| 8493 | 7.2% | |
| r | 7707 | 6.6% |
| o | 6563 | 5.6% |
| i | 6456 | 5.5% |
| l | 6231 | 5.3% |
| s | 5299 | 4.5% |
| t | 4552 | 3.9% |
| Other values (43) | 40091 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 117489 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 12691 | 10.8% |
| a | 10251 | 8.7% |
| n | 9155 | 7.8% |
| 8493 | 7.2% | |
| r | 7707 | 6.6% |
| o | 6563 | 5.6% |
| i | 6456 | 5.5% |
| l | 6231 | 5.3% |
| s | 5299 | 4.5% |
| t | 4552 | 3.9% |
| Other values (43) | 40091 |
| Value | Count | Frequency (%) |
| True | 4378 | |
| False | 4315 |
Interactions
Correlations
| Age | CryoSleep | Destination | FoodCourt | HomePlanet | RoomService | ShoppingMall | Spa | Transported | VIP | VRDeck | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.112 | 0.041 | 0.208 | 0.201 | 0.123 | 0.103 | 0.197 | 0.134 | 0.118 | 0.181 |
| CryoSleep | 0.112 | 1.000 | 0.119 | 0.161 | 0.118 | 0.153 | 0.070 | 0.140 | 0.468 | 0.080 | 0.127 |
| Destination | 0.041 | 0.119 | 1.000 | 0.092 | 0.262 | 0.035 | 0.009 | 0.068 | 0.111 | 0.043 | 0.062 |
| FoodCourt | 0.208 | 0.161 | 0.092 | 1.000 | 0.262 | 0.185 | 0.187 | 0.486 | 0.060 | 0.133 | 0.511 |
| HomePlanet | 0.201 | 0.118 | 0.262 | 0.262 | 1.000 | 0.150 | 0.054 | 0.190 | 0.195 | 0.177 | 0.197 |
| RoomService | 0.123 | 0.153 | 0.035 | 0.185 | 0.150 | 1.000 | 0.443 | 0.249 | 0.162 | 0.054 | 0.182 |
| ShoppingMall | 0.103 | 0.070 | 0.009 | 0.187 | 0.054 | 0.443 | 1.000 | 0.257 | 0.039 | 0.000 | 0.194 |
| Spa | 0.197 | 0.140 | 0.068 | 0.486 | 0.190 | 0.249 | 0.257 | 1.000 | 0.175 | 0.044 | 0.448 |
| Transported | 0.134 | 0.468 | 0.111 | 0.060 | 0.195 | 0.162 | 0.039 | 0.175 | 1.000 | 0.035 | 0.155 |
| VIP | 0.118 | 0.080 | 0.043 | 0.133 | 0.177 | 0.054 | 0.000 | 0.044 | 0.035 | 1.000 | 0.120 |
| VRDeck | 0.181 | 0.127 | 0.062 | 0.511 | 0.197 | 0.182 | 0.194 | 0.448 | 0.155 | 0.120 | 1.000 |
Missing values
Sample
| PassengerId | HomePlanet | CryoSleep | Cabin | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Name | Transported | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0001_01 | Europa | False | B/0/P | TRAPPIST-1e | 39.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Maham Ofracculy | False |
| 1 | 0002_01 | Earth | False | F/0/S | TRAPPIST-1e | 24.0 | False | 109.0 | 9.0 | 25.0 | 549.0 | 44.0 | Juanna Vines | True |
| 2 | 0003_01 | Europa | False | A/0/S | TRAPPIST-1e | 58.0 | True | 43.0 | 3576.0 | 0.0 | 6715.0 | 49.0 | Altark Susent | False |
| 3 | 0003_02 | Europa | False | A/0/S | TRAPPIST-1e | 33.0 | False | 0.0 | 1283.0 | 371.0 | 3329.0 | 193.0 | Solam Susent | False |
| 4 | 0004_01 | Earth | False | F/1/S | TRAPPIST-1e | 16.0 | False | 303.0 | 70.0 | 151.0 | 565.0 | 2.0 | Willy Santantines | True |
| 5 | 0005_01 | Earth | False | F/0/P | PSO J318.5-22 | 44.0 | False | 0.0 | 483.0 | 0.0 | 291.0 | 0.0 | Sandie Hinetthews | True |
| 6 | 0006_01 | Earth | False | F/2/S | TRAPPIST-1e | 26.0 | False | 42.0 | 1539.0 | 3.0 | 0.0 | 0.0 | Billex Jacostaffey | True |
| 7 | 0006_02 | Earth | True | G/0/S | TRAPPIST-1e | 28.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | NaN | Candra Jacostaffey | True |
| 8 | 0007_01 | Earth | False | F/3/S | TRAPPIST-1e | 35.0 | False | 0.0 | 785.0 | 17.0 | 216.0 | 0.0 | Andona Beston | True |
| 9 | 0008_01 | Europa | True | B/1/P | 55 Cancri e | 14.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Erraiam Flatic | True |
| PassengerId | HomePlanet | CryoSleep | Cabin | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Name | Transported | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8683 | 9272_02 | Earth | False | F/1894/P | TRAPPIST-1e | 21.0 | False | 86.0 | 3.0 | 149.0 | 208.0 | 329.0 | Gordo Simson | False |
| 8684 | 9274_01 | NaN | True | G/1508/P | TRAPPIST-1e | 23.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Chelsa Bullisey | True |
| 8685 | 9275_01 | Europa | False | A/97/P | TRAPPIST-1e | 0.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Polaton Conable | True |
| 8686 | 9275_02 | Europa | False | A/97/P | TRAPPIST-1e | 32.0 | False | 1.0 | 1146.0 | 0.0 | 50.0 | 34.0 | Diram Conable | False |
| 8687 | 9275_03 | Europa | NaN | A/97/P | TRAPPIST-1e | 30.0 | False | 0.0 | 3208.0 | 0.0 | 2.0 | 330.0 | Atlasym Conable | True |
| 8688 | 9276_01 | Europa | False | A/98/P | 55 Cancri e | 41.0 | True | 0.0 | 6819.0 | 0.0 | 1643.0 | 74.0 | Gravior Noxnuther | False |
| 8689 | 9278_01 | Earth | True | G/1499/S | PSO J318.5-22 | 18.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Kurta Mondalley | False |
| 8690 | 9279_01 | Earth | False | G/1500/S | TRAPPIST-1e | 26.0 | False | 0.0 | 0.0 | 1872.0 | 1.0 | 0.0 | Fayey Connon | True |
| 8691 | 9280_01 | Europa | False | E/608/S | 55 Cancri e | 32.0 | False | 0.0 | 1049.0 | 0.0 | 353.0 | 3235.0 | Celeon Hontichre | False |
| 8692 | 9280_02 | Europa | False | E/608/S | TRAPPIST-1e | 44.0 | False | 126.0 | 4688.0 | 0.0 | 0.0 | 12.0 | Propsh Hontichre | True |